108 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
Chinese
Availability:
Freely Available
License:
CC BY-NC-SA 4.0
Size:
90505 analogies OtherProduction Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:CA-EHN: Commonsense Analogy from E-HowNet
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Peng-Hsuan Li | CA-EHN | /N |
Documentation:
A README.md file in the git repository
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Chinese
Availability:
Freely Available
License:
Size:
170 hoursProduction Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Layer Pruning on Demand with Intermediate CTC
-
Paper track:14.12 Non-Autoregressive Sequential Modeling for S/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jaesong Lee | AISHELL-1 | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Arabic Catalan Chinese Dutch Estonian French German Indonesian Italian Japanese Latvian Mongolian Persian Portuguese Russian Slovenian Spanish Swedish Tamil Turkish Welsh
Availability:
Freely Available
License:
CC0
Size:
2880 hoursProduction Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:CoVoST 2 and Massively Multilingual Speech Translation
-
Paper track:12.1 Spoken machine translation/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Juan Pino | CoVoST 2 | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Bilingual
Languages:
Chinese English
Availability:
From Owner
License:
Size:
57 GByte Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:End-to-End Speech Translation with Knowledge Distillation
-
Paper track:12.1 Spoken machine translation/Poster Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yuchen Liu | English-Chinese TED | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Arabic Chinese English Iberian Slavic
Availability:
LRE 2017
License:
LDC
Size:
57 GByte Production Status:
Existing-used
Use:
Language Identification
-
Paper title:Attention based Hybrid I-vector BLSTM Model for Language Recognition
-
Paper track:4.1 Language identification and verification, lang/Poster Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Anand Mohan | NIST LRE 2017 | /N |
Documentation:
Yes, English, Yes
Speech
Corpus,
Language Type:
Monolingual
Languages:
Chinese
Availability:
Freely Available
License:
Apache License v.2.0
Size:
15 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Self-Attention Transducers for End-to-End Speech Recognition
-
Paper track:8.5 Novel neural network architectures (e.g. seque/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Zhengkun Tian | AISHELL-1 | /N |
Documentation:
http://www.openslr.org/33/
Written
Corpus,
Language Type:
Monolingual
Languages:
Chinese
Availability:
Freely Available
License:
Creative Commons BY-NC-ND 3.0
Size:
1.9 GByte Production Status:
Existing-used
Use:
Language Modelling
-
Paper title:Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
-
Paper track:8.6 Neural network training methods (including new/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ye Bai | CLMAD | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Chinese
Availability:
Freely Available
License:
Apache License v.2.0
Size:
15 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
-
Paper track:8.6 Neural network training methods (including new/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ye Bai | AISHELL-1 | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Chinese
Availability:
Freely Available
License:
CC0 1.0 Universal (CC0 1.0) Public Domain Dedication
Size:
10 Languages OtherProduction Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
-
Paper track:7.16 Tools and data for speech synthesis/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Thomas Mulc | CSS10 | /N |
Documentation:
Documentation available in English on
Written
Corpus,
Language Type:
Bilingual
Languages:
Chinese English
Availability:
Freely Available
License:
LICENSEE, by Princeton University
Size:
10 MByte Production Status:
Existing-used
Use:
Knowledge Discovery/Representation
-
Paper title:Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension
-
Paper track:Long/Question Answering
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Quan Wang | Wordnet | /N |
Documentation:
https://wordnet.princeton.edu/documentation




